Different approaches to the second order Klein-Gordon equation 
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We derive the Klein-Gordon equation for a single scalar field coupled to gravity at second order 
in perturbation theory and leading order in slow-roll. This is done in two ways: we derive the 
Klein-Gordon equation first using the Einstein field equations, and then directly from the action 
after integrating out the constraint equations. We also point out an unexpected result regarding 
the treatment of the field equations. 
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I. INTRODUCTION 

Cosmological perturbation theory is an essential tool which connects theories of the early universe with observation. 
Until recently it was sufficient to use linear perturbation theory to make and compare theoretical predictions with 
Cosmic Microwave Background (CMB) experiments and data from large scale structure surveys. One of the major 
breakthroughs at first order was the development of the gauge-invariant formalism, initiated by Bardeen and others 
[3, 0, [H, H|. The next round of CMB observations, which will be carried by observatories such as Planck, together 
with data from anisotropics in matter fields such as the neutral hydrogen, will require gauge-invariant second order 
perturbation theory. A vigorous effort is therefore under way to develop higher-order perturbation theory, and change 
it from what was once of mere academic interest and a calculational nuisance, into a powerful tool, see for example 

Refs. 

Considerable progress has already been made. Within the last five years, reliable predictions have become available 
for the deg ree of non-linearity imprinted by inflation in the CMB temperature anisotropy beyond linear order, e.g. [26|, 
H3, El [M M El El El E3 • Indeed, there is already some interesting tension between large classes of theoretically 
well-motivated models and observation on the basis of the non-linearity they predict [35, 36, 37, 38]. For example, one 
of the strongest bounds on the parameter space in the curvaton scenario comes from the limit on the non-gaussianity 
parameter /nl [H, HI IS, S3 • However, despite the considerable effort expended on the development of the theory, 
some interesting surprises can be found in unexpected "corners" . 

Much of this research has been framed in the context of the Lagrangian formalism, where one begins with an action 
and quantises it. In doing so, one encounters a variant of the Feynman diagrams which are used to compute scattering 
amplitudes in particle physics. This method of calculation has the advantage that a good deal of pre-existing knowledge 
can be imported wholesale from high-energy physics. Its principal drawback arises when making comparisons with the 
cosmological literature, which has traditionally approached perturbation theory from the standpoint of the Einstein 
equations. 

In this paper we explore the relationship between these two approaches by returning to the Einstein equations and 
calculating the Klein-Gordon equation at second order in perturbation theory, specialising to the case of a single 
scalar field and working to first order in the slow-roll approximation. This result has already been published in 
Ref. plj in the general case, but here we relate it to a different method of derivation, namely variation of the Einstein 
action coupled to a scalar field. In doing so we highlight a potential pitfall in the reduction of the full Einstein 
equations, which if unobserved can lead to errors at second order. In particular, this means that techniques which 
have been applied successfully at first order [3[ may need treating with more care once one incorporates the effect of 
non-linearities. We obtain consistent results in both cases. 

— 1/2 

Throughout this paper we use natural units where 8irG = M p is set equal to unity, where Mp is the so-called 
reduced Planck mass. Derivatives with respect to conformal time arc denoted by a prime. Greek indices, //., v, A. label 
spacetime coordinates and run from to 3, while lower case Latin indices, k, label purely spatial coordinates and 



run from 1 to 3. The metric convention is (— , +, +, +) and the background spacetime is taken to be of Friedmann- 
Robertson-Walker (FRW) form, 



ds 2 = g^dx^dx" = -dt 2 + a 2 6 ij dx i dx j , (1.1) 

where a = a(t) is the scale factor. It is usually more convenient to work in so-called conformal time, defined by 
r] = J t °° dt/a(t). Note that throughout this paper we work in a spatially flat background spacetime, and evaluate all 
perturbed quantities on uniform curvature hypersurfaces [l8l |. 

The paper is organised as follows. In the following section we define the metric and and the matter content of 
our spacetime. In Section IIIII we derive the Klein-Gordon equation using the field equations and highlight possible 
difficulties involving the spatial part of the second order field equations. In Section HVl we derive the Klein-Gordon 
equation directly from the action. We conclude with a discussion in Section fVl 



II. PRELIMINARIES 



In this section we define the metric and the matter variables we will use in the following sections. 



A. Notation 



We begin by defining the metric tensor in the uniform curvature gauge up to second order in the perturbations, 
including scalar perturbations only, 

9oo = -a 2 (1 + 2^1 + <f> 2 ) , goi = a 2 yB x + ^B-^j , g lJ =a 2 8 i j, (2.1) 

where 0i and (f> 2 together describe the lapse function, and B\ and B 2 comprise the scalar part of the shift function 
(describing the the shear in this gauge). Numerical subscripts denote the order of the perturbation, and a comma 
denotes a partial derivative, that is, X, = dX/dx 1 . The notation used in the present paper coincides with the 
conventions used, for example, in Ref. (2lj . In this notation, any tensorial quantity is expanded in a truncated power 
series, as shown above for the metric tensor (|2.1[) . Similarly the scalar field tp can be split into a background part and 
a perturbation, which we take to be 

(p = tpo + 8ipi + ^6tp 2 , (2.2) 

where tpo is the homogeneous background, 5ipi is chosen to obey Gaussian statistics, and the higher terms 5<p n , <j) n 
and B n for n ^ 2 are polynomials in 5(pi of order n. Note that since we work in the flat gauge, the field fluctuation 
at first and second order is merely the associated Sasaki-Mukhanov variable, [1, HHHi)]. The authors of Refs. [H, 
use a different convention for the perturbations. However, by observing that their field perturbation <50sl is related 
to our perturbations 8(pi and Sip 2 by 5</>sl = <Vi + ^8(p 2 , we can readily compare our results. 



B. Action and energy— momentum tensor 

The action for a single scalar field minimally coupled to standard Einstein gravity is [45l |4£ 



S= /d 4 aV^g f ^ft + Ap ] + [d 3 xVhlC, (2.3) 



where g^ is the spacetime metric, g = det g M „ is its determinant, 1Z is the spacetime Ricci scalar, and d denotes the 
boundary of spacetime, if one exists. If a boundary is present, then hij is its first fundamental form (also referred to 
as the induced metric, obeying hij = gij in the case of the boundary being a hypersurface of constant time), and fdj 
its second fundamental form (or extrinsic curvature), where K, — tr/C^. 
The matter Lagrangian is given by 

C v = -~9*'<P,v<P,M-U(<p), (2.4) 



where ip is the scalar field, U is its potential, and ip^ = dip/dx^. The energy-momentum tensor is obtained by 
varying the action with respect to the metric, giving 



(2.5) 



When the field equations are satisfied, this is automatically conserved as a consequence of Noether's theorem. For a 
single scalar field, T^ v takes the form 



1 



g^U(ip) . 



(2.6) 



III. DERIVING THE SECOND ORDER KLEIN-GORDON EQUATION USING THE FIELD 

EQUATIONS 

We now proceed to derive the governing equation for the scalar field, namely the Klein-Gordon equation, using the 
field equations. We therefore start by writing down Einstein's field equations 

= T^ v , (3--0 

which can be found by varying the action (12. 'i\ . For this purpose the inclusion of the Gibbons-Hawking term on the 
boundary is essential. The Bianchi identities then imply the conservation of energy-momentum, 

WT^ = 0, (3.2) 

where V denotes the covariant derivative. 



A. Field dynamics 

The energy conservation equation, Eq. (|3.2|) . gives an evolution equation for the 8ip n order-by-order (see e.g. Ref. [l8j 
for details). At zeroth order in perturbations, one finds the background equation 

<p% + 2H<p' + a 2 U^ = , (3.3) 

where Ti = a! fa and is related to the Hubble parameter H by Ti = aH. At first order we find 

8<pi' + 2H8cpi' - V 2 <5<^i + 2a 2 U^ 1 - ip' W 2 B 1 - ^<f>[ + cfU^Stpx = , (3.4) 

whereas after some calculation the second order equation becomes 

Sip 2 " + 2HS(p2 - V 2 Stf2 + a 2 U, vv S(p2 + a 2 U \ vvip 5tpi5tpi + 2a 2 [/ ¥ ,0 2 - <Po (^ 2 ^2 + $2) 

+ 4^5 fc B 1 9 fc 1 + 2 (2H<p' + a 2 U^) d k B 1 d k B 1 + 40! (a 2 U tVV 5 Vl - V 2 ^ x ) + Vo&0l 

- 28ipi (V 2 Bi + </)[) - Ad k 5L P i'd k B l = . (3.5) 

In order to arrive at an equation which can reasonably be thought of as a second-order version of the Klein-Gordon 
equation, we must eliminate the terms {Bi, B2, 4>\, ^2} which come from the metric and rewrite them in terms of the 
field fluctuations {Sipi, 6if2}- This is done by using the constraint part of the Einstein equations. Since we will use 
the slow-roll approximation to control this part of the calculation, it is first necessary to decide to what order in the 
approximation our calculations must be carried. 

The slow-roll expansion can be thought of as an expansion in powers of 2e = (ip' /Ti.) 2 , together with other small 
quantities obtained by differentiating e with respect to cosmic time t. It is easy to see from Eq. Q3.5P that all 
intrinsically second order terms (namely, B2 and 4> 2 ) are accompanied by a factor of i^q, whereas some terms formed 
from the product of first order quantities appear without any factors of the time derivative of the background field. 
Therefore we will aim to compute only to (p' /Ti. ~ y/e, for which purpose it is sufficient to obtain second order 
quantities only to zeroth order in slow-roll, but we must carry the calculation of first order quantities to the first 
non-trivial order. 

When e <C 1, one says that the field is in the slow-roll regime. Extracting the leading slow- roll part of the background 
equation, Eq. (13. 3|) . we obtain 



3Htp' + a 2 U. v = 0. 



(3.6) 



B. Einstein equations 



We now turn to the Einstein equations. These break into ten independent equations, of which some are evolution 
equations and some are constraints. In the present case there is only one degree of freedom, the field fluctuation 
{Sifi, 5if2}, and therefore there will ultimately be only one non-trivial evolution equation. In this section we aim to 
use the remaining equations, which are constraints, to obtain the metric quantities {-Bi, -B2, <j>i, $2} hi terms of the 
gaussian field fluctuation, 5ipi. 

1. Background 

At zeroth order in slow roll the background Einstein equations imply the Friedmann equation, which simplifies to 

H 2 = ^a 2 U . (3.7) 
This gives an evolution equation for H, namely H' = H 2 . 

2. First order 

Now consider the terms which are first order in perturbations. The (0, 0) component of the Einstein equations is 

2a 2 U <j) l + p'o<W + a 2 8V x + 2HV 2 B l = , (3.8) 

and the (0, i) part gives 

fl| i (2H0 1 -^i)=O, (3.9) 

where the gradient can readily be removed, e.g. by working Fourier space. Working to first order in slow roll and 
using the background equations given above, we find that Eq. (|3.8p can be rewritten as 

tp'oSipi +2HV 2 B 1 = 0. (3.10) 

This is a constraint which allows us to eliminate B\ from Eq. Q3.5P in favour of of the field fluctuation 5(fi. 
The component of the Einstein equation is given by 

(2W0i - (p'oSip! + 2a 2 U <f>i + a 2 6Ui) % + (V% - didj) (B[ + 2HB 1 + t ) = 0. (3.11) 

Taking the trace of Eq. (|3.1ip gives one scalar equation, 

3 (2H<i>i - <Po8<Pi + 2a 2 C/ o 0i + a 2 <5[/i) + 2V 2 (B[ + 2HB l + ^) = . (3.12) 

At linear order we have now two options to "extract" another scalar equations from Eq. (|3.11|) : 

• It is possible to simply read off the off-diagonal (or i ^ j) part of the equation, Eq. (|3.11[) . This method was 
used by Mukhanov, Feldman & Brandenberger [3j at first order, in conjunction with invariance arguments which 
allowed them to extend the three independent i ^ j equations to a single scalar equation which constrained the 
i =/= j term to vanish. 

• Another possibility, advocated here, is to apply the operator d l d^ to Eq. p. lip . The benefit in doing so is that 
one arrives at a scalar equation via manipulations which are manifestly tensorial. Observe that the operator 
V 2 8ij — didj is transverse to d l d^ and therefore vanishes, leaving an evolution equation for <f>i. 

Although both methods yield valid results at first order, it is not clear how the former should be extended to 
second order. On the other hand, the second method requires only textbook manipulations, leading to a correct 
scalar equation via an essentially mechanical process. Its principal disadvantage is the necessity to introduce inverse 
Laplacian operators. In either case, one can extract an evolution equation for the shear, 

W 2 (B[ + 2H5i + 0i) = 0, (3.13) 



and an evolution equation for the lapse function 

2U^ X + a 2 W x + 2a 2 U a fa - <p' 6cpi = . (3.14) 

After dropping terms which are higher order in slow-roll this simplifies substantially, yielding 

2W0' X - y>o<W = , (3.15) 

Any combination of Eq. (|3.10[) with Eqs. (|3 . 1 3[) or (|3 . 1 5[) allows us to determine fa in terms of Sipi. We therefore 
have both metric potentials which are necessary to rewrite Eq. Q3.4p as a single equation for Sipi. 



3. Second order 

As pointed out in Section IIII Al above we only require the second order field equations to zeroth order in slow 
roll. Reducing the calculation to the leading order slow-roll effect considerably simplifies both the calculation and 
presentation of our results. 

The (0, i) Einstein equation is given by 

d-Kfa - Sn'BtSip! = (3.16) 

This can be rewritten as a scalar equation by taking its divergence, yielding 

Ufa ~ V- 2 (Syi'VSipx + dkStfi&Sipx) = , (3.17) 

where we introduce the inverse Laplacian, which satisfies the identity V _2 (V 2 )A = X . 
The second order (i, j)-equation is given by 

{2H<j>' 2 + 2a 2 U fa) 5 l3 + (V 2 % - $0,-) (B' 2 + 2HB 2 + fa) + (d k S(p 1 d h ! <Vi - fy/ 2 ) % - 2d l 5 Vl d J S^ 1 = . (3.18) 

The spatial trace is obtained from Eq. (|3 . 1 8|) and satisfies 

?,H4>' 2 + 3a 2 U fa + V 2 (B 2 + 2HB 2 + fa) + - (-3^/ 2 + d k 8y x d k 8y^ = . (3.19) 

Following the discussion above, we now let didj act on Eq. (|3.18[) to extract another scalar equation from the (i,j) 
part, and observe that d l d : > (V 2 ^ — didj) X = 0. This gives 

W02+3HV" 2 (<Vi'V 2 <5<^i + d k S(p 1 'd k 5ip 1 )-V- 2 (V 2 <ViV 2 (5(pi + Sipi^Sipi + d k 5ip 1 V 2 d k 5<p 1 + d k Sip 1 'd k Sip 1 / ) = . 

(3.20) 

Note that this usefulness of this procedure is not specific to our choice of gauge or dependent on invoking the slow 
roll approximation. We believe it is simplest to employ our method to extract a scalar equation from the (i,j) part 
of the Einstein equation, (|3.18p . whenever the equation can not be written as a total derivative. This is in general 
the case at second order and higher, when terms of the form XuYu etc. will appear. 

How can we be confident that we are proposing the correct method? The first reason is that only this approach 
leads to a consistent set of Einstein equations. The other is that we get the same result for the Klein-Gordon equation 
using the action method, as shown in Section HVl 

Since equation l|3.20p is just an evolution equation for the second order lapse function, another way of deriving it, 
is to take the time derivative of the constraint (|3.17p . In this case we also need the first order Klein-Gordon equation 
in the extreme slow roll limit, which follows from Eq. (13. 4p as 

Stpi' + mSipi- V 2 6ip! =0. (3.21) 



Substituting Eq. (|3.2ip into the time derivative of Eq. (|3.17p . we recover Eq. (|3.20p . as required. 



C. The Klein—Gordon equation at second order 



We can now substitute the field equations at first and second order into the Klein-Gordon equation (|3.5|) . Working 
to leading order in slow roll we get 



-rri ~ f<W 2 + dkdip^Sipx) - 25ip 1 V 2 5ip 1 - V~ 2 \v 2 S^V 2 8^ + 8ipi'V 2 8ipi' 



8ip 2 + 2H8ip 2 ' - V 2 S(p 2 + a 2 U }VVV 5ipi 

+ H l ~ 

+d k S( Pl V 2 d k 6<p 1 + d k 5ip 1 'd k 5tp 1 ' + 2d k S(p 1 'V- 2 d k 6<p 1 '^ = . (3.22) 

When applying the slow-roll expansion, one ordinarily thinks of the higher derivatives of the potential, such as U^ VVVl 
as being negligible in comparison with the leading slow-roll terms. In Eq. (|3.22p this term has been retained. This 
is because although U. v /U 1 is necessary in order for inflation to occur at all, and U. vv /U <C 1 is necessary in 
order to have sufficient e-foldings, there are no such constraints for higher derivatives. Therefore it may well be that 
models exist in which U. vvv is unusually large. In such a model, U. vvv will make a contribution to the non-gaussianity 
imprinted in the cosmic microwave background which cannot be ignored [47| . 

IV. DERIVATION FROM THE THIRD ORDER ACTION 

Let us now return to the action and derive the Klein-Gordon equation from it. This follows a somewhat different 
procedure than that used with the field equations, owing to the way constraints are implemented in the action 
formalism. 

The action is found to depend only algebraically on the lapse and shift functions and their spatial derivatives; it 
involves no time derivatives of these quantities. Since one finds the equation of motion for each field, say W, by 
varying the action with respect to W and demanding that the resulting variation SS/8W is zero, the equations of 
motion for the lapse <j> and the shift B are also purely algebraic. They can be solved to give expressions in terms of 
the field fluctuation which hold at all times and do not require integrating any equation of motion. Once <p and B 
have been determined, they can be eliminated from the Lagrangian. Varying the resulting action with respect to the 
field gives the equation of motion directly, which must coincide with the Klein-Gordon e quat ion derived above. 

This calculation was initially given (in the flat and comoving slicings) by Maldacena [26j and refined in the flat 
slicing in Ref . [29] . In conformal time, the part of the action quadratic in S(fi2 can be written 



S 2 



~ J d V d 3 x a 2 [{5^ 2 ') 2 - (dS<p 2 ) 2 }. (4.1) 



On the other hand, there is also an "interaction" term which involves a product of the 6(p n . The leading term in this 
interaction is linear in 5<f2 and quadratic in Sipi, 



drj d x a 



^U vvv 5v2(5vi) 2 + || {<Wd fe V- 2 <W^i - [(Sipx'f + (dS^i) 2 ] 



permutations, 



(4.2) 

where the permutations are formed by swapping the position of S(f2 among the three possible locations. The higher 
derivative C/, vvv has been included to account for the possibility that it is unusually large, as discussed above. The 
field equation for 6(f2 is obtained by demanding that 5S/S(Sif2) — 0, ignoring any boundary terms. 
Consider first S*2. It is clear that 8S2I 'S(Sip 2 ) can be written 

5S 2 = jJ d^d 3 ^ 8{8ip 2 ) a 2 (-5<p 2 " - 2H5<p 2 ' + V 2 ^ 2 ) + ^ ^ d 3 x a 2 £(<5^ 2 )^ 2 '. (4.3) 

Now consider S3. From left to right this breaks into four terms, the first of which is the variation of the U^ vvv term 



and is trivial. The variation of the second term gives (including the effect of permuting the location of Sip 2) 



5S 3 D / d 3 a; a 2 ^-S{Sp 2 ) {^V" 2 Sipt' d k Sip x - V~ 2 [d k Sipx' d k Sip! + Sip^Sip^ } 
Jd 4/ l 



(v 2 <Vi + d k Sp 1 d k ) 



5S 2 



{Sipi 



+ J d v d 3 x a 2 ^S(Sip 2 )l V" 2 [V 2 ^iV 2 ^i + d k Sip l d k V 2 Sip 1 + d k Sip 1 'd k Sip 1 ' + Sipi^Sipi] 

- d k Sp 1 d k Sp 1 - 23 k Sp 1 'd k V- 2 Sp 1 ' - {Sipx'f I . (4.4) 

The terms proportional to 5S 2 /S(Sipi) vanish when they are evaluated on the classical solution, which is obtained 
by demanding that Eq. (|4.3|) is zero. For this purpose one ignores the surface term, choosing boundary conditions 
for S(Sipi) such that it vanishes. If one chooses to retain the surface term then the corresponding surface terms in 
Eq. (|4.4| are eliminated, but any gain is illusory since they must then be added back in from the equation of motion 
for Sipi. 

The third term in Eq. (|4.2p gives a variation 



\ 3 xa 2 ^ 

m 

and, likewise, the fourth term contributes 



SS 3 D I d 3 x a 2 ^-5(Sp 2 ) {-<Vi<W} + / dnd 3 * a 2 ^-5(5p 2 ) , 
J d 4H 'J m. \ a 2 



Sipi J^ 2 N + SpiV 2 Sipi + -(Sipi ) 2 



S(Sipi) 



5S 3 D J d v d 3 x a 2 ^5(5p 2 )\^d k 5p 1 d k 5p 1 +5p 1 V 2 5p 1 ^ 



(4.5) 



(4.6) 



These terms all combine to give the total variation. Discarding those terms proportional to SS 2 /S(Sipi) together with 
all surface terms, one finds the overall Klein-Gordon equation 

Sip 2 " + 2H5ip 2 - V 2 Sip 2 + U }ipipv (5ipi) 2 = 

+ H i ~ \ dk6 ^ dk5 ^ ~ \( s <Pi? + 2^iV 2 ^! - 2d k S^ l 'd k V- 2 8p 1 ' 

+ V- 2 [V 2 ^iV 2 (5(pi + d k Sip 1 d k V 2 Sip 1 + d k 8ip 1 'd k 8ip 1 ' + Sip 1 'V 2 Sip 1 '] |. (4.7) 

Equation (|4.7[l agrees exactly with the Klein-Gordon equation which was found using the Einstein equations, 
Eq. J322}. 



V. DISCUSSION 



In this paper, we have derived the Klein-Gordon equation at second order in the perturbations and leading order in 
slow-roll. We have shown explicitly for the first time that the two most popular approaches in the literature, based on 
the Einstein field equations and the action principle, yield equivalent results. This goes some way to demonstrating 
the consistency of current beyond-leading-order calculations, including those of the non-linearity parameters /nl and 
tnl [H, US El which will be important for future CMB experiments. 

In many cases one can choose freely whether the action principle or the Einstein field equations are more appropriate 
for the task in hand. In the case of the Einstein equations, one needs the relevant constraint equations at nth order, 
and an evolution equation. In principle this is contained in the Einstein equations themselves, but in practise it is 
often more convenient to obtain an evolution equation from the Bianchi identities, as we have done in the present 
paper. For the action, one must compute the Lagrangian to (n + l)th order, and then eliminate the constraints 
associated with the lapse function and shift vector. However, owing to the precise way these appear in the action, 
there is a special simplification which implies one need obtain explicit expressions only to (n — l)th order. The result, 
at least for low orders of perturbation theory, is that the amount of work required in both approaches seems much 
the same. 

These two methods are traditionally used for different purposes. The field equations are ordinarily the method 
of choice for obtaining numerical solutions. On the other hand, the action is typically preferred when one aims to 



compute scattering amplitudes, or more generally the correlation functions of a quantised theory. This is because 
the action, being a scalar, manifestly exhibits the full group of symmetries associated with the theory. Moreover, it 
facilitates the use of the so-called "interaction picture" which is the most common approach for dealing with fields 
beyond first order in perturbation theory. 

In some cases, however, there is no choice and one must work with the equations of motion. For example, this is 
the case with dissipative systems such as fluids for which it is difficult or impossible to formulate an action principle. 
In this case one would like to extend the existing second-order framework to allow the computation of quantities 
traditionally associated with the action, such as the correlation functions of the quantised theory. The analysis given 
in this paper is a first step in that direction. 

We have pointed out that some care is required in reducing the full set of Einstein equations. In general this 
necessitates contracting spatial tensors into scalar equations by the application of a suitable derivative operator. 
Together with the spatial trace, this gives two scalar equations which at first order coincide with the well-known 
standard results Q. At second order, the analysis requires some more care in order to obtain correct results. 
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